Structural Modeling of Fundamental Frequency Contour for Thai Expressive Speech
نویسنده
چکیده
Problem statement: Appropriate modeling of fundamental Frequency (F0) contour for speech is a key factor to preserve the quality of speech prosody. One successful approach has been conducted for tonal language of Mandarin Chinese. It is based on the assumption that the behavioral characteristics of vocal-fold elongation in vibration could be approximated by those of a simple forced vibrating system. Therefore this approach has been applied to model Thai expressive speech with bestfit function. Approach: An approach of structural modeling of voice F0 contours of Thai expressive speech utterances using an approximation by those of a simple forced vibrating system has been conducted. Nowadays, modeling of F0 contours of Thai expressive speech is very important in an analysis of speech, which brings about the speech communication with more interesting and effective. Our speech database consists of male and female speech and each one contains 4 different speech styles including angry style, sad style and enjoyable style and reading style. We use 5 sentences for each speech style and each sentence includes 100 samples. The speech sample in each group is analyzed for an F0 contour, subsequently a number of structural modeling parameters are extracted for each contour. Thereafter, the parameters are used to synthesis the F0 contour and then the synthesized contour is compared with that of natural speech by calculating RMS error. Results: From the experimental analysis, it is observed that RMS error of each speech style is different from the others. It reveals that the mentioned structural modeling responses to each speech style differently. Moreover the reading style has the smallest error among all styles. Conclusion: From the finding, it is a definite evidence to apply the modeling approach to the speech synthesis systems with good preservation of speech prosody.
منابع مشابه
Modeling of Fundamental Frequency Contour of Thai Expressive Speech using Fujisaki’s Model and Structural Model
Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplishe...
متن کاملThai Expressive Speech Processing Technology: A Review
Problem statement: The studies on Thai expressive speech or emotional speech have been conducted for years. Most of them are expected to analysis the characteristics of Thai expressive speech. However, the conclusive reviews on these studies have not been conducted for further study on the speech technology or application of Thai expressive speech. Approach: The review of research on Thai expre...
متن کاملStructural Modeling of Fundamental Frequency contour for Thai Tones
Problem statement: In Thai, tone is an essential feature of a prosodic syllable to identify the meanings of that syllable or that part of word. To generate the tonal speech with natural prosody, it is needed to manage the fundamental frequency (F0) of the speech appropriately. A successful approach of structural modeling from Mandarin Chinese has been adapted to model Thai tone. Approach: The s...
متن کاملAnalytical Study on Fundamental Frequency Contours of Thai Expressive Speech Using Fujisaki’s Model
Problem statement: In spontaneous speech communication, prosody is an important factor that must be taken into account, since the prosody effects on not only the naturalness but also the intelligibility of speech. Focusing on synthesis of Thai expressive speech, a number of systems has been developed for years. However, the expressive speech with various speaking styles has not been accomplishe...
متن کاملAnalytical Study of Fujisaki’s Model of Fundamental Frequency Contour for Thai Tones
Problem statement: Tone of a tonal language is an important feature of a prosodic syllable to identify the meanings of that syllable or that part of word. Ii is very crucial to model the feature related to tone of speech to achieve the most naturalness in speech communication. Approach: The study presents an approach to analyze the model parameters of Thai tones for two genders. The successive ...
متن کامل